1,953 research outputs found

    Challenges and solutions for Latin named entity recognition

    Get PDF
    Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and other forms of prose and poetry, the body of Latin texts is still relatively sparse compared to English. Data sparsity in Latin presents a number of challenges for traditional Named Entity Recognition techniques. Solving such challenges and enabling reliable Named Entity Recognition in Latin texts can facilitate many down-stream applications, from machine translation to digital historiography, enabling Classicists, historians, and archaeologists for instance, to track the relationships of historical persons, places, and groups on a large scale. This paper presents the first annotated corpus for evaluating Named Entity Recognition in Latin, as well as a fully supervised model that achieves over 90% F-score on a held-out test set, significantly outperforming a competitive baseline. We also present a novel active learning strategy that predicts how many and which sentences need to be annotated for named entities in order to attain a specified degree of accuracy when recognizing named entities automatically in a given text. This maximizes the productivity of annotators while simultaneously controlling quality

    Estimation of the capacitor voltages in flying capacitor multi-level converters

    Get PDF
    Flying capacitor multi-level converters use charged capacitors as a critical element. For proper operation, the capacitor voltages must be known and regulated. Direct measurement is straightforward, but when the number of required measurements is high, this is a complicated approach. This paper introduces an estimation method for n capacitor voltages. The scheme is based on a system of (n + 1) equations, which is defined in a way that it incorporates information of the actual capacitor voltages and open-loop estimates of the capacitor voltages. The solution of the system gives the best estimates of the capacitor voltages in the sense of least square errors. The method requires only output voltage and output current sensors and its computational burden is low, saving cost and design efforts, and simplifying the hardware system. The performance of the proposed scheme is evaluated experimentally in a 9-level flying capacitor chopper, where the estimates are used by the voltage balancing strategy and by the output current controller

    Fifteen new risk loci for coronary artery disease highlight arterial-wall-specific mechanisms

    Get PDF
    Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Although 58 genomic regions have been associated with CAD thus far, most of the heritability is unexplained, indicating that additional susceptibility loci await identification. An efficient discovery strategy may be larger-scale evaluation of promising associations suggested by genome-wide association studies (GWAS). Hence, we genotyped 56,309 participants using a targeted gene array derived from earlier GWAS results and performed meta-analysis of results with 194,427 participants previously genotyped, totaling 88,192 CAD cases and 162,544 controls. We identified 25 new SNP-CAD associations (P < 5 × 10(-8), in fixed-effects meta-analysis) from 15 genomic regions, including SNPs in or near genes involved in cellular adhesion, leukocyte migration and atherosclerosis (PECAM1, rs1867624), coagulation and inflammation (PROCR, rs867186 (p.Ser219Gly)) and vascular smooth muscle cell differentiation (LMOD1, rs2820315). Correlation of these regions with cell-type-specific gene expression and plasma protein levels sheds light on potential disease mechanisms

    Genome-Wide Association Analysis of Incident Coronary Heart Disease (CHD) in African Americans: A Short Report

    Get PDF
    African Americans have the highest rate of mortality due to coronary heart disease (CHD). Although multiple loci have been identified influencing CHD risk in European-Americans using a genome-wide association (GWAS) approach, no GWAS of incident CHD has been reported for African Americans. We performed a GWAS for incident CHD events collected during 19 years of follow-up in 2,905 African Americans from the Atherosclerosis Risk in Communities (ARIC) study. We identified a genome-wide significant SNP (rs1859023, MAF = 31%) located at 7q21 near the PFTK1 gene (HR = 0.57, 95% CI 0.46 to 0.69, p = 1.86×10−08), which replicated in an independent sample of over 8,000 African American women from the Women's Health Initiative (WHI) (HR = 0.81, 95% CI 0.70 to 0.93, p = 0.005). PFTK1 encodes a serine/threonine-protein kinase, PFTAIRE-1, that acts as a cyclin-dependent kinase regulating cell cycle progression and cell proliferation. This is the first finding of incident CHD locus identified by GWAS in African Americans

    Energy Estimation of Cosmic Rays with the Engineering Radio Array of the Pierre Auger Observatory

    Full text link
    The Auger Engineering Radio Array (AERA) is part of the Pierre Auger Observatory and is used to detect the radio emission of cosmic-ray air showers. These observations are compared to the data of the surface detector stations of the Observatory, which provide well-calibrated information on the cosmic-ray energies and arrival directions. The response of the radio stations in the 30 to 80 MHz regime has been thoroughly calibrated to enable the reconstruction of the incoming electric field. For the latter, the energy deposit per area is determined from the radio pulses at each observer position and is interpolated using a two-dimensional function that takes into account signal asymmetries due to interference between the geomagnetic and charge-excess emission components. The spatial integral over the signal distribution gives a direct measurement of the energy transferred from the primary cosmic ray into radio emission in the AERA frequency range. We measure 15.8 MeV of radiation energy for a 1 EeV air shower arriving perpendicularly to the geomagnetic field. This radiation energy -- corrected for geometrical effects -- is used as a cosmic-ray energy estimator. Performing an absolute energy calibration against the surface-detector information, we observe that this radio-energy estimator scales quadratically with the cosmic-ray energy as expected for coherent emission. We find an energy resolution of the radio reconstruction of 22% for the data set and 17% for a high-quality subset containing only events with at least five radio stations with signal.Comment: Replaced with published version. Added journal reference and DO

    Measurement of the Radiation Energy in the Radio Signal of Extensive Air Showers as a Universal Estimator of Cosmic-Ray Energy

    Full text link
    We measure the energy emitted by extensive air showers in the form of radio emission in the frequency range from 30 to 80 MHz. Exploiting the accurate energy scale of the Pierre Auger Observatory, we obtain a radiation energy of 15.8 \pm 0.7 (stat) \pm 6.7 (sys) MeV for cosmic rays with an energy of 1 EeV arriving perpendicularly to a geomagnetic field of 0.24 G, scaling quadratically with the cosmic-ray energy. A comparison with predictions from state-of-the-art first-principle calculations shows agreement with our measurement. The radiation energy provides direct access to the calorimetric energy in the electromagnetic cascade of extensive air showers. Comparison with our result thus allows the direct calibration of any cosmic-ray radio detector against the well-established energy scale of the Pierre Auger Observatory.Comment: Replaced with published version. Added journal reference and DOI. Supplemental material in the ancillary file

    Measurement of the cosmic ray spectrum above 4×10184{\times}10^{18} eV using inclined events detected with the Pierre Auger Observatory

    Full text link
    A measurement of the cosmic-ray spectrum for energies exceeding 4×10184{\times}10^{18} eV is presented, which is based on the analysis of showers with zenith angles greater than 6060^{\circ} detected with the Pierre Auger Observatory between 1 January 2004 and 31 December 2013. The measured spectrum confirms a flux suppression at the highest energies. Above 5.3×10185.3{\times}10^{18} eV, the "ankle", the flux can be described by a power law EγE^{-\gamma} with index γ=2.70±0.02(stat)±0.1(sys)\gamma=2.70 \pm 0.02 \,\text{(stat)} \pm 0.1\,\text{(sys)} followed by a smooth suppression region. For the energy (EsE_\text{s}) at which the spectral flux has fallen to one-half of its extrapolated value in the absence of suppression, we find Es=(5.12±0.25(stat)1.2+1.0(sys))×1019E_\text{s}=(5.12\pm0.25\,\text{(stat)}^{+1.0}_{-1.2}\,\text{(sys)}){\times}10^{19} eV.Comment: Replaced with published version. Added journal reference and DO

    Coding Variation in ANGPTL4, LPL, and SVEP1 and the Risk of Coronary Disease.

    Get PDF
    BACKGROUND: The discovery of low-frequency coding variants affecting the risk of coronary artery disease has facilitated the identification of therapeutic targets. METHODS: Through DNA genotyping, we tested 54,003 coding-sequence variants covering 13,715 human genes in up to 72,868 patients with coronary artery disease and 120,770 controls who did not have coronary artery disease. Through DNA sequencing, we studied the effects of loss-of-function mutations in selected genes. RESULTS: We confirmed previously observed significant associations between coronary artery disease and low-frequency missense variants in the genes LPA and PCSK9. We also found significant associations between coronary artery disease and low-frequency missense variants in the genes SVEP1 (p.D2702G; minor-allele frequency, 3.60%; odds ratio for disease, 1.14; P=4.2×10(-10)) and ANGPTL4 (p.E40K; minor-allele frequency, 2.01%; odds ratio, 0.86; P=4.0×10(-8)), which encodes angiopoietin-like 4. Through sequencing of ANGPTL4, we identified 9 carriers of loss-of-function mutations among 6924 patients with myocardial infarction, as compared with 19 carriers among 6834 controls (odds ratio, 0.47; P=0.04); carriers of ANGPTL4 loss-of-function alleles had triglyceride levels that were 35% lower than the levels among persons who did not carry a loss-of-function allele (P=0.003). ANGPTL4 inhibits lipoprotein lipase; we therefore searched for mutations in LPL and identified a loss-of-function variant that was associated with an increased risk of coronary artery disease (p.D36N; minor-allele frequency, 1.9%; odds ratio, 1.13; P=2.0×10(-4)) and a gain-of-function variant that was associated with protection from coronary artery disease (p.S447*; minor-allele frequency, 9.9%; odds ratio, 0.94; P=2.5×10(-7)). CONCLUSIONS: We found that carriers of loss-of-function mutations in ANGPTL4 had triglyceride levels that were lower than those among noncarriers; these mutations were also associated with protection from coronary artery disease. (Funded by the National Institutes of Health and others.).Supported by a career development award from the National Heart, Lung, and Blood Institute, National Institutes of Health (NIH) (K08HL114642 to Dr. Stitziel) and by the Foundation for Barnes–Jewish Hospital. Dr. Peloso is supported by the National Heart, Lung, and Blood Institute of the NIH (award number K01HL125751). Dr. Kathiresan is supported by a Research Scholar award from the Massachusetts General Hospital, the Donovan Family Foundation, grants from the NIH (R01HL107816 and R01HL127564), a grant from Fondation Leducq, and an investigator-initiated grant from Merck. Dr. Merlini was supported by a grant from the Italian Ministry of Health (RFPS-2007-3-644382). Drs. Ardissino and Marziliano were supported by Regione Emilia Romagna Area 1 Grants. Drs. Farrall and Watkins acknowledge the support of the Wellcome Trust core award (090532/Z/09/Z), the British Heart Foundation (BHF) Centre of Research Excellence. Dr. Schick is supported in part by a grant from the National Cancer Institute (R25CA094880). Dr. Goel acknowledges EU FP7 & Wellcome Trust Institutional strategic support fund. Dr. Deloukas’s work forms part of the research themes contributing to the translational research portfolio of Barts Cardiovascular Biomedical Research Unit, which is supported and funded by the National Institute for Health Research (NIHR). Drs. Webb and Samani are funded by the British Heart Foundation, and Dr. Samani is an NIHR Senior Investigator. Dr. Masca was supported by the NIHR Leicester Cardiovascular Biomedical Research Unit (BRU), and this work forms part of the portfolio of research supported by the BRU. Dr. Won was supported by a postdoctoral award from the American Heart Association (15POST23280019). Dr. McCarthy is a Wellcome Trust Senior Investigator (098381) and an NIHR Senior Investigator. Dr. Danesh is a British Heart Foundation Professor, European Research Council Senior Investigator, and NIHR Senior Investigator. Drs. Erdmann, Webb, Samani, and Schunkert are supported by the FP7 European Union project CVgenes@ target (261123) and the Fondation Leducq (CADgenomics, 12CVD02). Drs. Erdmann and Schunkert are also supported by the German Federal Ministry of Education and Research e:Med program (e:AtheroSysMed and sysINFLAME), and Deutsche Forschungsgemeinschaft cluster of excellence “Inflammation at Interfaces” and SFB 1123. Dr. Kessler received a DZHK Rotation Grant. The analysis was funded, in part, by a Programme Grant from the BHF (RG/14/5/30893 to Dr. Deloukas). Additional funding is listed in the Supplementary Appendix.This is the author accepted manuscript. The final version is available from the Massachusetts Medical Society via http://dx.doi.org/10.1056/NEJMoa150765
    corecore